CN2-MCI: A Two-Step Method for Constructive Induction
نویسنده
چکیده
Methods for constructive induction perform an automatic transformation of description spaces if representational shortcomings deteriorate the quality of learning. In the context of concept learning and propositional representation languages, feature construction algorithms have been developed in order to improve the accuracy and to decrease the complexity of hypotheses. Particularly, so-called hypothesis-driven constructive induction (HCI) algorithms construct new attributes based upon the analysis of induced hypotheses. Well-known HCI-systems analyze decision trees, or employ a coarse-grained analysis of decision rules. This paper introduces a new constructive operator o and documents its applicability in the usual HCI-framework. o uses a cluster algorithm to map selected features into a new binary feature. A new method for constructive induction, CN2-MCI, is described that applies o as its only constructive operator to achieve a ne-grained analysis of decision rules. The output of CN2-MCI is an inductive hypothesis expressed in terms of the transformed representation, given training examples as input. CN2-MCI is theoretically and empirically compared with existing methods for constructive induction.
منابع مشابه
Generation of Attributes for Learning Algorithms
Inductive algorithms rely strongly on their representational biases, Constructive induction can mitigate representational inadequacies. This paper introduces the notion of a relative gain measure and describes a new constructive induction algorithm (GALA) which is independent of the learning algorithm. Unlike most previous research on constructive induction, our methods are designed as preproce...
متن کاملA Wrapper Approach for Constructive Induction
Inductive algorithms rely strongly on their representational biases. Representational inadequacy can be mitigated by constructive induction. This paper introduces the notion of a relative gain measure and describes a new constructive induction algorithm (GALA) which is independent of the learning algorithm. GALA generates a small number of new boolean attributes from existing boolean, nominal o...
متن کاملMultistrategy Constructive Induction: AQ17-MCI
This paper presents a method for multistrategy constructive induction that integrates two inferential learning strategies—empirical induction and deduction, and two computational methods—data-driven and hypothesis-driven. The method generates inductive hypotheses in an iteratively modified representation space. The operators modifying the representation space are classified into "constructors,"...
متن کاملAnalysis of Example Weighting in Subgroup Discovery by Comparison of Three Algorithms on a Real-life Data Set
This paper investigates the implications of example weighting in subgroup discovery by comparing three state-of-the-art subgroup discovery algorithms, APRIORI-SD, CN2-SD, and SubgroupMiner on a real-life data set. While both APRIORI-SD and CN2-SD use example weighting in the process of subgroup discovery, SubgroupMiner does not. Moreover, APRIORI-SD uses example weighting in the post-processing...
متن کاملRule Induction with CN2: Some Recent Improvements
The CN2 algorithm induces an ordered list of classiication rules from examples using entropy as its search heuristic. In this short paper, we describe two improvements to this algorithm. Firstly, we present the use of the Laplacian error estimate as an alternative evaluation function and secondly, we show how unordered as well as ordered rules can be generated. We experimentally demonstrate sig...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994